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Title: Improvements in or Relating to Gene Expression 



Field of the Invention 

This invention relates to a nucleic acid sequence opcLmised for expression in a plant cell, 
and to vectors and plant cells comprising the nucleic acid sequence. 

Background of the Invention 

In yeast there is a gene called GAL4, the product of which acts as a transcriptional 
activator (Johnston 1987 Microbiol. Rev. 51, 458-476). The GAL4 gene is quite large 
(3Kb) and the encoded polypeptide comprises an N-terminal DNA-binding domain, a C 
terminal domain having a transcriptional activator function, and an intervening glucose- 
responsive element "GRE" (GAL4 being repressed in the presence of glucose). 

The GAL4 protein has been quite well characterised. Amino acid residues 1-147 of the 
protein bind to DNA in a sequence-specific manner (Keegan et al 1986 Science 231, 699- 
704). The transcriptional activation function is associated with two short ponions of the 
C terminal domain (Ma & Ptashne 1987 Cell 48, 847-853). The DNA sequence to which 
GAL4 binds has been identified as a 17mer, which must be present as a repeat for optimal 
GAL4 binding (Giniger et a/,_1985 Cell 40, 767-774), and may he referred to as a GAlA-~ 
responsive "upstream activation sequence" (UAS): GAL4 binds to the UAS 5" of a gene 
by means of the DNA-binding domain, and the C-terminal domain causes up-regulation 
of transcription of the gene. 

Recently, a two-element system has been developed for directing gene expression in 
Drosophila melanogaster. Brand and Perrimon (1993 Development 118 : 401-415,) 
randomly inserted the gene encoding the GAL4 into the Drosophila genome using a P- 
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element based vector. 

A large number of stable Drosophila lines were generated which each express GAL4 in 
a particular pattern, dependent on adjacent genomic DNA sequences. A chosen target 
gene could then be cloned under the control of GAL4 upstream activation sequences 
(UAS), separately transformed, and maintained silently in the absence of GAL4. Genetic 
crossing of this single line with any of the library of GAL4-containing lines allowed 
activation of the target gene in many different tissue and cell types, and the phenotypic 
consequences of mis-expression, including those lethal to the organism, could be 
conveniently studied. In addition, the library of GAL4-containing flies has become an 
increasingly characterised and shared resource, and which provides a common entry point 
for various "reverse" genetic techniques. 

A similar system operable in plants would be highly desirable. However, expression of 
heterologous eukaryotic genes in plants has proved highly problematical in the past (see, 
for example. Green Fluorescent Protein) and efficient expression of GAL4 appears equally 
difficult, and it has been suggested that this is due to inefficient translation of GAL4 
mRNA in plants (Reichel et al, 1995 Plant Cell Reports 14. 773-776). Ma et al, (1988 
Nature 334 . 631-633) were able to obtain transient expression of modified, functional 
GAL4 derivatives in tobacco-leaf protoplasts but could not demonstrate the presence of 
functional, full-length wild-type GAL4. Similarly, transient expression of GAL4 
derivatives has been demonstrated in maize protoplasts (McCarty et al, 1991 Cell 66, 895- 
905) and when introduced by biolistic methods, in maize aleurone tissues or embryogenic 
calli {Goffet al, 1991 Genes & Dev. 5, 298-309; Goffer a/, 1992 Genes & Dev. 6, 864- 
875). Hitherto however, there have been no reports of stable, efficient expression of 
functional GAL4 or derivaties thereof in a plant cell. 

It is an aim of the present invention to provide a novel expression system operable in 
plants, utilising a modified portion of the GAL4 gene. 

Summary of the Invention 
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In a first aspect the invention provides a nucleic acid sequence, expressible in a plant ceil 
encoding at least an effective portion of a GAL4 DNA-binding domain, the sequence 
having an A/T base content substantially reduced relative to the wild-type yeast sequence. 

The A/T content of the wild-type yeast sequence encoding the DNA-binding domain of 
GAL4 is about 59%. The % A/T base content of the sequence of the invention encoding 
the effective ponion of the GAL4 DNA-binding domain will be understood to be 
_ substantially^ reduced, when it is less than 50%, -Preferably the A/T content is less than 
45%, and more preferably less than 40%. The sequence of the invention may be made, 
for example, by site-directed mutagenesis, or be made de novo by chemical synthesis. 

An "effective portion" of the DNA-binding domain is a ponion sufficient to retain most 
(i.e. over 50%) of the DNA-binding activity of the full length DNA-binding domain. 
Typically the "effective portion" will comprise at least two thirds of the full length 
sequence of the DNA-binding domain. Conveniently the nucleic acid sequence will 
encode substantially all of the GAL4 DNA-binding domain (i.e. amino acid residues 1-147 
of the yeast polypeptide), although a substantially smaller portion (about 75 amino acid 
residue) is quite adequate to retain most of the DNA-binding activity (Kraulis et al, 1992 
Nature 356, 448-450; and Marmorstein et al, 1992 Nature 356, 408-414), In one 
panicular embodiment, the sequence will comprise the nucleotide sequence shown in the 
5' ponion (nucleotides 1 to about 460) of Figure 1, which sequence has a substantially 
reduced A/T content (38%), relative to the wild-type yeast sequence, yet encodes an 
identical amino acid sequence. 



The GAL4_DNA2binding domain, has,^ no transcriptionaL activation function-in its own 
right. Thus, in preferred embodiments, the sequence encoding the effective portion of the 
GAL4 DNA-binding domain will be operably linked to one or more other nucleic acid 
sequences, which sequences may be structural (i.e. encode functional polypeptides) and/or 
regulatory. Typically the sequence encoding the DNA-binding domain will be operably 
linked (e.g. fused in-frame) to a sequence encoding a peptide or polypeptide with a 
regulatory function, preferably a transcriptional activator. The transcriptional activator 
may be the activation domain of GAL4 protein, the sequence encoding which should 
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preferably be optimised for expression in plants (e.g. by reducing the A/T content 
thereof). Alternatively, the transcriptional activator could be any one of a number of such 
proteins known to be active in plants, which will be well-known to those skilled in the art, 
such that the sequence of the invention encodes a chimeric polypeptide, comprising at least 
an effective portion of the GAL4 DNA-binding domain and a transcriptional activation 
domain. A particularly suitable transcriptional activator domain is that obtainable from 
herpes simplex virus (HSV) VP-16, (see Greaves and O'Hare 1989 J. Virol 6i, 1641- 
1650; VP. being also known as VMW65). Other transcriptional activation domains 
include certain peptides encoded by E. coli genomic DNA fragments (Ma & Ptashne 1987 
Cell 51, 113-119) or synthetic peptides designed to form amphiphilic a-helix (Giniger & 
Ptashne 1987 Nature 330, 670-672). A common feature appears to be the requirement for 
excess charge, either positive (Gill & Ptashne 1987 Cell 51, 121-126) or, more especially 
for plant activation domains, excess negative charge (Estruch etal, 1994 NucL Acids Res. 
22, 3983-3989). With this in mind, the person skilled in the art could readily synthesise, 
or fmd naturally-occurring sequences, which encode peptides or polypeptides with 
transcriptional activation activity in plants. Such activator sequences may, in any event, 
be modified for optimal activity in a plant cell. 

In a further aspect the invention provides a nucleic acid construct, comprising the nucleic 
acid sequence defined above. In one particular embodiment, the nucleic acid construct 
could be used as an "enhancer-trap" to "fish" for plant enhancer sequences (Cf. 
Sundaresan et al, 1995 Genes & Dev. 9, 1797-1810). In such an embodiment, the 
construct will preferably include right and left Ti-DNA, to allow for random, stable 
insertion into the genome of a plant cell host. The construct will preferably comprise a 
naive plant promoter sequence, by which term is meant a promoter (operable in a plant 
cell) which requires the presence of a suitable enhancer sequence to cause substantial 
levels of transcription. Such a "naive" promoter may be thought of as an "enhancer- 
dependent" promoter, and essentially corresponds to the TATA box region of known plant 
promoters. "Plant" promoters includes reference to viral and bacterial promoter sequences 
which are active in a plant cell (e.g. nucleotides -48 to +1 of the CaMV 35S promoter). 
The naive promoter is conveniently substantially adjacent to one Ti border, the promoter 
being in competent relationship with the sequence encoding the GAL4 DNA-binding 
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domain fused with a transcription activation domain operable in plants such that, should 
the promoter become insened in the plant host cell genome in functional relationship with 
an enhancer sequence, the promoter will direct the expression of the GAL4 DNA-binding 
domain and the transcriptional activation domain. The constmct may additionally 
comprise a plant selectable marker (e.g. Kanamycin resistance) for convenience. 

Additionally, the construct may comprise a reporter gene active in plants (such as Green 
-Fluorescent-Protein, or-j8-gIuGuronidase - GUS") in operable linkage to a GAL4-responsive - 
upstream activation sequence (UAS), such that the reporter gene will be expressed in 
response to synthesis of the GAL4 DNA-binding domain/transcriptional activator fusion 
protein. Preferably the reponer gene is modified GFP as disclosed in WO 96/27675, or 
GUS (conveniently with a nuclear localisation signal). Preferably the UAS comprises one 
or more repeats of the 17mer to which GAL4 binds (Giniger et al, 1985 Cell 40, 767- 
774). 

As an alternative method of introducing the GAL4 binding domain/transcriptional activator 
sequence into a plant host cell genome, the construct may comprise Ds ("dissociation") 
elements, which may then be moved about the genome by the action of a transiently- 
expressed Ac (Activator) enzyme (e.g. Cocherel et al, 1996 Plant Mol. Biol. 30. 539- 
551). 

In a further aspect, the invention provides a plant or part thereof (e.g. a plant host cell or 
cell line) comprising the construct defined above. Typically the construct will have 
become integrated into a plant cell genome and be stably maintained therein. In particular 
^the_ invention provides a -plurality of -plants or parts thereof -(such as isolated plant-cells,- 
or cell lines, or tissue cultures) comprising a library, each plant or part thereof comprising 
a stably maintained nucleic acid sequence encoding an effective portion of the GAL4 
DNA-binding domain as defined above. Conveniently the nucleic acid sequence will be 
incorporated into the genome of plant cells present in the library. 

The library of plants or parts thereof thus provides a very useful resource. The library 
may be e.g. Arabidopsis rhaliana, or of any other plant which is routinely used in 
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research. Each plant or part thereof in the library may have a particular pattern of 
expression of the integrated reporter gene (see, for example, Sundaresan et aL, 1995 
Genes & Dev. 9, 1797-1810; Klimyuk et aL, 1995 Molec. Gen. Genet. 249, 357-365). 
Thus, introduction of a further gene, having a GAL4-responsive UAS into the cell line, 
will result in expression of the introduced gene in the same temporal/spatial pattern as the 
reporter gene, such that researchers can express a particular gene of interest in selected 
tissues and/or at selected times in a predictable manner. 

The invention thus provides a method of expressing a gene of interest in a known 
(predictable) pattern in a plant or part thereof, the method comprising: introducing the 
gene of interest into the plant or part thereof, said gene of interest having a GAL4- 
responsive upstream activation sequence (UAS), characterised in that said plant or part 
thereof comprises a reporter gene expressed in a known pattern under the influence of a 
transcriptional activator comprising an effective portion of a GAL4 DNA-binding domain 
encoded by a sequence in accordance with the invention defined above, such that binding 
of the transcriptional activator to the UAS causes transcriptional activation of the gene of 
interest. 

The temporal/spatial pattern of expression of the introduced gene or genes of interest will 
then mirror that already known for the reporter gene. Typically the plant or part thereof 
will be Arabidopsis thaliana, but may be any other plant which is routinely studied (e.g. 
maize, rice, potato etc). 

Typically the gene or genes of interest will be introduced by crossing GAL4-expressing 
plant cell lines with a plant cell line comprising the gene(s) of interest linked to a GAL4- 
responsive UAS. 

The nucleic acid sequence of the invention has other useful applications, in addition to use 
as an "enhancer trap", or to direct the expression of a gene or genes of interest in a 
predictable manner. The modified GAL4 DNA-binding domain/transcriptional activator 
may also be used to co-ordinate the expression of a plurality of genes of interest. In many 
circumstances it is desirable to express simultaneously a number of genes in a plant either 
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for research purposes, or for agricultural/industrial motives (e.g. to study metabolic 
pathways, or to manipulate the synthesis of desirable products such as plant dyes or 
lipids). 

Thus, the invention provides a method for co-ordinating the expression of a plurality of 
genes of interest in a plant or part thereof, each of the genes of interest being functionally 
associated with a GAL4-responsive UAS, characterised in that the plant or part thereof 
coinprises a sequence in accordance with the first _aspect of the invention defined above, 
and is capable of expressing a transcriptional activator comprising an effective portion of 
a GAL4 DNA-binding domain, such that binding of the transcriptional activator to the 
UAS causes simultaneous transcriptional activation of all of the genes of interest. 

Conveniently the plurality of genes of interest may all be associated, in a poiycistronic 
arrangement, with a single UAS, which facilitates their introduction into the plant or part 
thereof (the genes of interest typically being genes which are not naturally found in the 
plant or part thereof e.g. mammalian or bacterial genes, or genes from a different plant 
or possibly natural plant genes which have been the subject of various modifications). 
Alternatively one or more genes may be operably linked to a respective UAS. The 
-transcriptional activator may be encoded by a sequence already present in the plant or part 
thereof, or may be introduced simultaneously with (or subsequent to) introduction of the 
plurality of genes of interest. 

The invention will now be further described by way of illustrative examples and with 
reference to the accompanying drawings, of which: 

Figure 1 shows a nucleotide sequence in accordance with the invention encoding a 
modified GAL4 DNA-binding domain, fused in frame to a sequence encoding a modified, 
transcriptional activation domain from HSV VP16, the vertical line denotes the end of the 
GAL4 DNA-binding domain; 

Figure 2 is a schematic representation of a nucleic acid construct comprising a sequence 
in accordance with the invention; 
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Figure 3 shows the nucleotide sequence of part of the nucleic acid construct represented 
schematically in Figure 2; and 

Figures 4a-4d show micrographs of roots of plants comprising a sequence in accordance 
with the invention. 

Examples 

The inventors have used an approach similar to that described by Brand & Perrimon 
(1993, cited above), but adapted it foi: use in Arabidopsis, using Agrobacterium 
tumefasciens - mediated transformation to produce "enhancer trap" plant cell lines with a 
transcription activator comprising the GAL4 DNA-binding domain, fused to the activation 
domain of HSV VP16 (mGAL4-VP16). 

Example 1 - Construction of mGAL4-VP16 gene 

Eight oligonucleotides, which correspond to the modified sequence of the GAL4 DNA 
binding region, were synthesised. 

GAL-5' (28-mer) 

GGC AAC AAT GAA GCT ACT GTC TTC TAT C (Seq. ID No. 3) 
VP16-3' (31-mer) 

GGC AGA TCT ACC CAC CGT ACT CGT CAA TTC C (Seq. ID No.4) 
MGALl (109-mer) 

GGC AAG CTT GGA TCC AAC AAT GAA GCT CCT GTC CTC CAT CGA GCA 
GGC CTG CGA CAT CTG CCG CCT CAA GAA GCT CAA GTG CTC CAA GGA 
GAA GCC GAA GTG CGC CAA G (Seq. ID No, 5) 

MGAL2 (108-mer) 

TCC TCT CGA GGG AAG ATC AGG AGG AAG AGC TGC TCC AGG CGC TCC 
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AGG CGG GAG TCC ACT TGG GTG AGG TGG GCG CGG GTC AGC GGG GAG 
CGC TTG GTT TTG GGA GAG (Seq. ID No. 6) 

MGAL3 (81-mer) 

TTC COT CGA GAG GAG CTC GAG ATG ATC CTG AAA ATG GAG TCC CTC 
CAG GAC ATC AAA GCC CTG CTC ACC GGC CTC TTC GTC (Seq. ID No. 7) 

MGAL4 (89-mer) _ _ _ _ 

GGG TGA GGG GCA TGT CGG TCT CCA CGG AGG CCA GGC GGT CGG TGA 
CGG CGT CTT TGT TCA CGT TGT CCT GGA CGA AGA GGC CGG TGA GC (Seq. 
ID No. 8) 

MGAL5 (80-mer) 

GGA GAC CGA CAT GCC CCT CAC CCT GCG CCA GCA CCG CAT CAG CGC 
GAC CTC CTC CTC GGA GGA GAG CAG CAA CAA GGG CC (Seq. ID No. 9) 

MGAL6 (83-mer) 

AGT GGA GCT CGT CCC CCA GGC TGA CGT CGG TCG GGG GGG CCG TCG 
AGA CGG TCA ACT GGC GCT GGC CCT TGT TGC TGG TCT CC (Seq. ID No. 10) 

The full-length oligonucleotides were purified by electrophoresis in a 5 % polyacrylamide 
gel containing 7M urea and 90niM Tris-borate ImM EDTA pH8.3 (TBE) buffer. The 
fractionated oligonucleotides were detected by brief staining with filtered 0.05% toluidine 
blue dye, excised and eluted overnight at 37°C in 0.5M ammonium acetate. 0.1% SDS 
and 0. ImM EDTA. The eluted DNAs were precipitated and washed with ethanol, and 
phophorylated using T4 piynucleotide kinase. DNAs (0.5ug each) were separately 
resuspended in 50mM Tris-HCl pH 9.0, lOmM MgCl,, lOmM DTT with ImM ATP and 
5 units T4 polynucleotide kinase, and incubated for 30 mins at 37°C. 

The plasmid pCMVGal65 (Cousens et al, 1989 EMBO J. 8, 2337-2342) was used as a 
source of the GAL4-VP16 sequence with unmodified codon usage. Two restriction 
endonuclease fragments were isolated from the plasmid. A SacI-SacI fragment, which 
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contains the sequence encoding the DNA binding element of the Saccharomyces cerevisiae 
GAL4 protein, and a Sacl-Kpnl fragment, which encodes the activation domain of the 
herpes simplex virus VP 16 protein, were purified by fractionation and eiution from a low 
gelling temperature (LGT) 1% agarose gel. 

The 5' portion of the mGAL4-VP16 sequence was amplified by the polymerase chain 
reaction (PGR), using the GAL-5' and MGAL2 oligos as primers and the Sacl-Sacl GAL4 
DNA fragment as a template. Primers and template were incubated under standard 
conditions with VENT DNA polymerase (New England Biolabs) and subjected to 30 
cycles of 94^C for 30 sees, 50°C for 1 min, 72^C for 1 min. The product was cut with 
Xhol restriction endonuclease and purified after electrophoresis through a 1.5% LGT 
agarose gel. 

The MGAL4 and MGAL5 oligonucleotides were annealed by heating to 65 °C and slow 
cooling to room temperature. The MGAL3 and MGAL6 oligonucleotides were then added 
and also annealed to the mixture. The oligonucleoddes were then incubated with Klenow 
fragment of DNA polymerase 1 and T4 DNA ligase in 50mM Tris-HCI pH 7.5, lOmM 
MgCl2, lOmM DTT, ImM dNTPs and ImM ATP for 60 mins at 37°C. An excess.of 
MGAL3 and MGAL6 oligonuceotides were added, and the mixmre was subject to PGR 
amplification, as above. The product was cut with the restriction endonucleases Xhol and 
Sad, and purified after electrophoresis through a 1.5% LGT agarose gel. 

The three gel-purified restriction fragments: (1) the GAL-57MGAL2 PCR-amplified 
product, corresponding to the 5' portion of the GAL4 DNA binding domain, (2) the 
MGAL3-6 annealed and amplified product, corresponding to the 3' portion of the GAL4 
DNA binding domain, and (3) the Sacl-Kpnl fragment from pCMVGal65, corresponding 
to the VP 16 activation domain; were mixed and ligated with T4 DNA ligase in 50mM 
Tris-HGl pH 7.5, lOmM MgClj, lOmM DTT and ImM ATP at 20=*C overnight. The 
ligated product was PGR amplified (as above) using MGALl and VP16-3' as primers. 
The product, containing the entire modified CAL4'VPI6 sequence was cut v/ithBamHl and 
ligated to PBI121 (Jefferson etal, EMBO J. 6:3907, 1987) which had been cut with Sad, 
blunt ended by treatment with T4 DNA polymerase, and re-cut with BamHL. The 
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recombinant plasmid (pBIN 35S'mGAL4-VP16) contained the mGAL4-VP16 gene 
downsteam of the constitutive 35S plant promoter. 

In Figure 1, the lower DNA sequence is that of the fusion gene (Seq. ID No. 1), encoding 
in the 5' portion the GAL4 DNA-binding domain, and encoding in the 3' portion the 
transcriptional activation domain from HSV VP16. The wild-type sequence is shown 
above for comparison. It will be seen that the wild type GAL4 DNA-binding domain 
-coding sequence- is A/-T rich- (A/T% content is -shown. on the right for each- line-of- 
sequence), and it is believed that this causes inefficient expression in plants. In particular, 
it is believed that the A/T rich DNA comprises one or more regions which, when 
transcribed, are recognized as mRNA splice sites in plant cells. The altered nucleotides 
are shown outlined in Figure 1. The encoded polypeptide sequence is shown below (Seq. 
ID No. 2). The modifications to the nucleotide sequence were carefully selected to ensure 
that there was no resulting change in the encoded amino acid sequence. The numbers 
above the sequences represent the number of the nucleotide or amino acid at that position. 
Certain restriction endonuciease sites are also shown. 

The 35S promoter and mGAL4-VP16 gene were excised by digestion with the restriction 
endonucleases HinDlU and EcoRL, purified by 1% LGT agarose gel electrophesis, and 
subcloned into a vector containing a GAL4-responsive mgfpS-ER marker gene (see 
Example 2 below). The resulting construct is shown in Figure 2, in which R and L 
represent right and left Ti plasmid border sequence. The GAL4-VP16 fusion gene is 
downstream of the TATA-box region (nucleotides -48 to +1) of the CaMV 35S gene, 
were insened adjacent to the right Ti border. The plasmid also contains the kanamycin 
^resistancjs^ (Kan RX selectable marker,^ and a„reporter gene (GUS or GFP)_ operably linked ^ 
to a synthetic GAL4-UAS promoter sequence. Certain restriction endonuciease sites are 
also shown. 

Figure 3 shows the nucleotide sequence (Seq. ID No. 15) of the plasmid across the 
junctions of the Ti right border, the CaMV35S TATA box, and Lhe start of the GAL4 
DNA-binding domain coding region. 
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The mGAL4-VP 16 gene was directly assayed for activity in transformed Arabidopsis 
plants by Agrobacterium-TnG^diatod transformation (Valvekens et al Proc. Natl. Acad. Sci 
USA 85:5536-5540, 1988). 

Example 2 

Construction of GAL-GFP enhancer trap vector. 
Modification of the T-DNA right border. 

The plasmid pBIN19 (Bevan, M. Nuc. Acids Res. 12:8711-8721, 1984) was digested with 
the restriction endonuclease Sacll, and a 2.6 Kb fragment contaimng the Agrobacterium 
T-DNA right border sequence was subcloned into SacU cut pGEM 5Zf(Genbank accession 
No. X65308; commercially available from Promega). The bacteria containing the 
recombinant phagemid were superinfected with helper phage to produce single strand 
phagemid DNA. 

A synthetic mutagenic oligonucleotide ("TR-h": ATA TCC TGT CAA ACA CTG GAT 
CCG AGC TCC AAT TCA TAG TTF AAA CTG AAG GCG GG; Seq. ID No. 11) was 
kinased and annealed to the purified phagemid DNA, extended on the template nsin^ T7 
DNA polymerase, and ligated using T4 DNA ligase. The DNA was transformed into 
bacteria, and recombinant plasmids (pTr-f ) were screened for the insertion of new sites 
for the restriction endonucleases BamHI, Smal and EcoRl immediately adjacent to the T- 
DNA right border. 

Insertion of TATA box region. 

The TATA box region of the 35S promoter was PGR amplified from pBI121 using 
oligonucleotides complementary to the -48 region of the promoter (oligonucleotide 
A35S5^/II; GGC AGA TCT TCG CAA GAG CCT TCC TCT ATA TAA GG; Seq. ID 
No. 12) and the downstream )3-glucuronidase gene (oligonucleotide GVSSndBl; CAC ACA 
AAC GGT GAT ACG TA; Seq. ID No. 13). The PGR product was cut with restriction 
endonucleases Bgai and BamHl and ligated to BamUl cut pTr+. The resulting 
recombinant plasmid (pTr-hA35S) contained a minimal (naive) promoter positioned 
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adjacent to the T-DNA right border. 

Insertion of 35S promoter driven mGAL4-VP 16 gene. 

Derivatives of the GAL4 with unmodified codon usage were inserted into the pTr4-A35S 
plasmid in unsuccessful trial experiments, and one of these constructions (the insenion of 
a truncated GAL4 gene derived from pMA236; Ma <?/ al, 1988 Nature 334, 631-633) had 
resulted in the insertion of a unique HindlU site immediately adjacent to the BamHl site 
in pTR-f A35S. This plasmid, pTr+AGAL4, was digested with restriction endonucleases 
HindlU and EcoRl, and ligated with the HinDIU'EcoRL fragment from pBIN 35S-mGAL4- 
VP 16, to introduce the 35S-driven mGAL4-VP 16 gene. The Agrobacterium sequences 
from this plasmid (pTr-h35S-/?zG^4-VP 16) were then recloned into a binary plant 
transformation vector which contained a GAL4-responsive marker gene (see below), for 
testing in plant a. 

Construction of a synthetic GAL4- responsive promoter. 

A HinDUI'Xbal restriction endonuclease fragment containing five optimised binding sites 
for GAL4 was excised from the plasmid pUAST (Brand & Perrimon,. 1993 Development, 
118, 401-415), and ligated into HinDIU-Xbal cut pBIlOl (Jefeerson et aL, 1987 EMBO 
J. 6, 3901-3907) upstream of the GUS gene (to make pBINAUAS-GUS). A sequence 
corresponding to the -90 region of the 35S promoter w^as PGR amplified using 
oligonucleotides DELASl (GAA CTC TAG A AG CTA CTC CAC GTC CAT A AG GGA 
CAC ATC ACA ATC CCA CTA TCC TTC GC; Seq. ID No. 14) and GUSSndBl (see 
above). The product was Xbal-BamHl cut and cloned into similarly cut pBIN AUAS- 
GUS. The resulting plasmid (pBIN UAS(-90-ASl)GUS) contains a synthetic GAL4 
promoter upsteam of the jS-glucuronidase (GUS) coding sequence. The GUS gene_v\^as . 
then replaced with that of the green fluorescent protein (GFP). 

GAL4-responsive mgfp5-ER gene. 

We have extensively mutated the green fluorescent protein gene for optimised expression 
in plants. The optimised gene is called mgf]p5-ER (Siemering et al\ Current Biology 
6:1653-1663, 1996; W096/27675). 
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The mgfp5-ER gene has been cloned into the plant transformation vector pBI121, and we 
have excised a BamUl-Sacl fragment, which contains the coding sequence, and have 
insened this into BamHl-Sacl cut pBIN UAS(-90-ASl)GUS to replace the GUS gene. 
This plasmid (pBIN UAS(~90- AS I)mgfp5-ER) was cut with SacU, and ligated with the 
Sacll fragment containing the modified T-DNA right border and mGAL4-VPl6 gene from 
pTr+35S'mGAL4-VP]6, to give pBIN 35S'GAL4-VP]6'h VAS-mgfpS-ER (abbreviated to 
pBIN35S-GAL-GFP). 



Testing- of the pBIN35S-GAL-GFP vector. 

The pBIN 35S-GAL-GFP plasmid was introduced into Agrobacterium tumefaciens strain 
LBA4044 (Jefferson etaL^ EMBO J, 6:3901-3907, 1987) by electroporation. followed by 
selection on kanamycin-containing plates. Th^ Agrobacterium strain was used to transform 
Arabidopsis using the technique described below. Plants were scored for the GAL4- 
dependent induction of GPP fluorescence after transformation. 

Construction of GAL-GFP enhancer trap vector 

The pTR-i-35S-A72Gi4L^-V7'75 plasmid was treated with BamHI (to excise the 35S promoter 
sequences) and religated using T4 DNA ligase. This plasmid (pTR-\-mGAL4''VP16) 
contains the modified GAL4'VP16 positioned with a minimal promoter adjacent to the T- 
DNA right border. The T-DNA sequences were then excised by digestion with SacR, and 
ligated with similarly cut pBIN VASi'90-ASl)mgfp5-ER. This produced pBIN GAL-GFP, 
which contains the highly active mGAL4-VP16 transcription acdvator, responsive to 
adjacent enhancer elements, and a GAL4-dependent GFP gene. GFP fluorescence is 
produced in response to GAL4-VP16 activation. 

Enhancer trap screen. 

The enhancer trap vector pBIN GAL-GFP was introduced into Agrobacterium tumefaciens 
(see details below) LBA4044, and has been used to generate over 7,500 u^nsformed 
Arabidopsis seedlings. Transgenic seedlings have been screened directly for GAL4- 
mediated GFP expression, and seeds were collected from expressing plants. We have a 
library of over 250 Arabidopsis lines which show stable inheritable patterns of GAL4- 
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mediated GFP expression in the root. 
Transactivation. 

To demonstrate GAL4-mediated transactivation of a foreign gene we have generated 
trsLnsg^nicArabidopsisxh^t contain the T-DNA sequences from pBIN UAS(-90-ASl)GUS). 
In the absence of GAL4-VP16, these plants do not express the GUS gene. However, 
when genetically crossed with lines expressing the mGAL4-VP16 gene, the GUS gene is 
activated in a pattern of expression which accurately reflects that of the mGAL4-VP16 gene 
from the donor parent (see Example 3 below). 

Example 3 

A stable transformed line (J2302) of Arabidopsis thaliana, which formed part of the 
library described above, expressed modified GFP (under the influence of the mGAL4- 
VP16 activator) in the cells of the extreme root tip (see Figures 4a and 4b). 

Figure 4a is a low power micrograph showing GFP-mediated epifluorescence (400nm 
excitation) at the root tip. Figure 4b is a higher magnification confocal micrograph 
(488nm excitation) showing the area of GFP-mediated fluorescence in greater detail. 

The line J2302 was crossed (using standard techniques) with another Arabidopsis line 
which comprised a stable, silently-mamtained GUS reporter gene operably linked to a 
GAL4-responsive UAS. The product of the cross, as expected, proceeded to express 
GUS, under the influence of the GAL4-VP16 transcriptional activator, in the same pattern 
as the GFP reponer gene in J2302 (i.e. at the extreme root tip). Suitably stained samples, 
imaged by brightfield microscopy, are shown in Figures 4c and 4d, and demonstrate GUS 
reporter gene activity (darkest staining) at the root tip. 

This illustrates that the modified GAL4 DNA-binding domain sequence can 

(i) successfully be used in the construction of enhancer trap vectors; 

(ii) express genes of interest (as exemplified by GUS) in a predictable pattern; and 

(iii) co-ordinate the simultaneous expression of a plurality of genes of interest (as 
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exemplified by GFP and GUS). 
Example 4 

Transformation of Arabidopsis thaliana. 

The methods employed are based on those originally disclosed by Valvekens et aL 1988. 
Proc. Natl. Acad Sci, 85, 5536-5540). 

Media 

Unless otherwise stated all procedures were carried out asepdcally using sterile solutions 
and equipment. All media are used in standard plastic disposable petri dishes or, for later 
stages. Magenta OA 7 pots (Magenta Corp., USA). Hormones are dissolved in dimethyl 
sulfoxide (DMSO) as xlOOO stocks. Hormones and antibiotics are added after autoclaving 
and cooling of the media to 65°C. 

1. Germination medium (GM) comprised: Ix Murashige and Skoog salt mixture; 1% 
sucrose; 100 mg/1 inositol; 1.0 rag/I thiamine; 0.5 mg/I pyridoxine; 0.5 mg/1 nicotinic 
acid; 0.5 g/1 2-(N-morpholino)ethanesulfomc acid (MES); (adjust to pH 5.7 with IM 
KOH); and 0.8% Difco Bacto agar (for solid GM medium). 

la. GM K50 

As GM, but supplemented with: 50 mg/1 kanamycin (Sigma) 

2. Callus-inducing medium (CIM) comprised: Ix Gamborg's B5 medium; 2% glucose; 
0.5 g/I MES (pH 5.7, adjusted with IN KOH); 0.5 mg/I 2,4-D (Sigma); 0.05 mg/1 
kinetin (Sigma); and 0.8 % agar (for solid CIM medium). 

3. Shoot- inducing medium with Vancomycin (SIM V750 K50) comprised: Gamborg's B5 
medium; 2% glucose; 0.5 g/1 MES (pH 5.7); 0.8% agar; 5 rag/I N6-(2- 
isopentenyI)adenine (2ip); 0,15 mg/1 indole-3-acetic acid (lAA); 750 mg/1 vancomycin; 
and 50 mg/1 kanamycin. 



3a. SIM V500 K50 
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As SIM, but supplemented with: 500 mg/I vancomycin: and 50 mg/1 kanamycin. 

4, Root-inducing medium (RIM) comprised: Gamborg*s B5 medium; 2% glucose: 0.5 g/1 
MES (pH 5.7); 0.8% agar; 12.5 mg/1 indole butyric acid (IBA); and 50 mg/1 cefotaxime. 

Growth of plants 

(i) Place seeds into a 15 ml polypropylene centrifuge tube. 

(ii) Add ethanol for 2 min. Remove ethanol with pipette. . 

(iii) Replace with 5% commercial bleach ( — 0.25% available chlorine) containing one drop 
of NP40 per 50 ml. Leave for 15 min, shaking regularly. 

(iv) Wash seeds in sterile, distilled water for at least three times. 

(v) After last wash, add the sterile seeds to 100ml of GM in a conical flask. Grow with 
shaking for 2-4 weeks in a culture room at 20-25'*C. 

Agrob acteriuutn-mediated transformation. 

1, Day One 

(i) Using a scalpel, cut-off all the green (upper) parts of the plantlets to leave only the root 
systems. 

(ii) Lay out the roots on plates containing solid CIM. Gently press down on each bunch 
of roots to ensure that they are in contact with the surface of the agar. 

(iii) Incubate for 3 d in growth room (22°C, continual light). 

2. Day Four 

(i) After ensuring that there is no visible sign of contamination, collect the callus-induced 
roots in an empty petri dish. 

(ii) Cut the roots into 0.5 cm explants. 

(iii) Add 3-5 ml of Agrobacterium culture which has been grown overnight at 28oC in 
Luria broth, and washed by centrifugation. Swirl with blunt-nosed forceps. Leave to co- 
cultivate for 2min. 

(iv) Blot the roots dry on double-thick sterile filter-paper ( Whatman No.l) in a petri dish. 

(v) Place the explants onto solidified CIM medium in a petri dish, and gently press them 
onto the surface to ensure contact with the medium. 
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(vi) Incubate plates in the growth room for 2 d to allow co-cultivation of Agrobacterium. 
3. Day Six 

(i) Transfer the explants to a petri dish in 20-25 ml of water. 

Agitate with blunt-nosed forceps to wash off the agrobacteria. The bathing medium should 
become somewhat mrbid. Transfer the root pieces to a sieve and repeat the washing. Then 
lift the sieve out of buffer and drain. Press down the explants down onto the mesh to 
remove as much buffer and agrobacteria as possible. Push the semi-dried explants together 
into a mass. 

(Custom-made, autoclavable and re-usable, sieves are used to hold the explants during 
Agrobacterium infections and washings of root explants. These sieves are made from 100 
ml plastic three-cornered beakers and 100 um nylon mesh. The top 3-4 cm is cut off the 
beaker and the mesh held in place across the bottom of this piece by pushing a ring 2 cm 
high cut from just above the base of the beaker into the lower portion of the top part. 
The two pieces are sealed together by pushing a hot metal rod through them in several 
places.) 

(ii) Blot small bundles of root material on double-thick sterile filter-paper in a petri dish 
until dry and transfer to solidified SIM V750 K50 medium, taking care that root explants 
are in close contact with the medium. 

(iii) Incubate in growth room. 

Regeneration 

(i) In growth room, tiny green kanamycin resistant calli appear on the yellowish root- 
explants after 3 weeks, 

(ii) Every 2 weeks, transfer explants to fresh SIM V750 K50. Shoots (often initially 
vitreous) interminently appear from the green calli over the next several weeks. 

(iii) Transfer these shoots to RIM, where roots will develop over a period of 2 to 3 weeks 

(iv) After the formation of roots, transfer the plantlets to GM in vented. Magenta GA7 pots 
or directly to soil. Ensure that the humidity is kept low to ensure good seed set.- 

(v) Harvest seed pods when they turn yellowish-brown and are therefore mature. 

Germination and screening of Fl progeny. 
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To test for transformation, Fl seedlings are germinated on GM K50. 

(i) Put seeds on GM K50 (surface sterilise the seeds if necessary). 

(ii) Put peiri dishes in the dark at 4oC (refrigerator) for 3-5 d, to break seed dormancy. 
This is not necessary if seeds were stored for more than a month. 

(iii) Incubate petri dishes in growth room for 2 weeks. Sensitive seedlings form neither 
roots nor leaves, and have white cotyledons. Transformed seedlings are phenotypically 
normal. 

(iv) Transgenic callus and shoots were screened for GFP expression using a inverted 
fluorescence microscope (Leitz DM-IL) fitted with a filter set (Leitz-D excitation BP355- 
425, dichroic 455, emission LP460 ) suitable for the main 395nm excitation and 509nm 
emission peaks of GFP. The use of a 7mm threaded extension mbe with a 4x objective 
(EF 4/0.12) gave a greater working distance, and has allowed the convenient direct 
observation of tissue within inverted sealed petri dishes. A 100 Watt long wavelength 
hand-held UV lamp (UV Products, BIOOAP) was also used for routine monitoring of 
transgenic shoots and plants. Transgenic Arabidopsis Fl seedlings were grown in sterile 
agar culture for 5 days, and were mounted in water under glass coverslips for microscopy. 
The specimens were examined using a BioRad MRC-600 laser-scanning confocal 
microscope equipped with a krypton-argon laser and filter sets suitable for the detection 
of fluorescein and texas red dyes (BioRad K1/K2), and a Nikon 60x PlanApo N.A. 1.2 
water immersion objective. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: BTG INTERNATIONAL LIMITED 
(ii) TITLE OF INVENTION: IMPROVEMENTS IN OR RELATING TO GENE 

EXPRESSION 
(iii) NUMBER OF SEQUENCES: 15 
(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: FETHERSTONHAUGH & CO. 

(B) STREET: P.O. BOX 2999, STATION D 
10 (C) CITY: OTTAWA 

(D) STATE: ONT 

(E) COUNTRY: CANADA 

(F) ZIP: KIP 5Y6 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: ASCII (text) 
(vi) CURRENT APPLICATION DATA: 

2 0 (A) APPLICATION NUMBER: CA 

(B) FILING DATE: 14 -FEB- 1997 

(C) CLASSIFICATION: 
(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: GB 9603069.7 

(B) FILING DATE: 14-FEB-1996 
(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: FETHERSTONHAUGH £c CO. 

(B) REGISTRATION NUMBER: 

(C) REFERENCE/DOCKET NUMBER: 23410-576 

3 0 (ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (613) -235-4373 

(B) TELEFAX: (613 ) -232-8440 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 701 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 17. .694 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

AAGCTTGGAT CCAACA ATG-AAG CTC-CTG-TCC ^TCC ATC GAG-CAa GCC TGC - 49, 

10 Met Lys Leu Leu Ser Ser lie Glu Gin Ala Cys 

1 5 10 

GAC ATC TGC CGC CTC AAG AAG CTC AAG TGC TCC AAG GAG AAG CCG AAG 97 
Asp lie Cys Arg Leu Lys Lys Leu Lys Cys Ser Lys Glu Lys Pro Lys 
15 20 25 

TGC GCC AAG TGT CTG AAG AAC AAC TGG GAG TGT CGC TAC TCT CCC AAA 145 

Cys Ala Lys Cys Leu Lys Asn Asn Trp Glu Cys Arg Tyr Ser Pro Lys 

30 35 40 

20 

ACQ AAG CGC TCC CCG CTG ACC CGC GCC CAC CTC ACC GAA GTG GAG TCC 193 

Thr Lys Arg Ser Pro Leu Thr Arg Ala His Leu Thr Glu Val Glu Ser 
45 50 55 

CGC CTG GAG CGC CTG GAG CAG CTC TTC CTC CTG ATC TTC CCT CGA GAG 241 
Arg Leu Glu TVrg Leu Glu Gin Leu Phe Leu Leu lie Phe Pro Arg Glu 
60 65 . 70 75 

GAC CTC GAC ATG ATC CTG AAA ATG GAC TCC CTC CAG GAC ATC AAA GCC 289 
3 0 Asp Leu Asp Met lie Leu Lys Met Asp Ser Leu Gin Asp lie Lys Ala 

80 85 90 

CTG CTC ACC GGC CTC TTC GTC CAG GAC AAC GTG AAC AAA GAC GCC GTC 337 
Leu Leu Thr Gly Leu Phe Val Gin Asp Asn Val Asn Lys Asp Ala Val 
95 100 105 

ACC GAC CGC CTG GCC TCC GTG GAG ACC GAC ATG CCC CTC ACC CTG CGC 3 85 

Thr Asp Arg Leu Ala Ser Val Glu Thr Asp Met Pro Leu Thr Leu Arg 
110 115 120 

40 

CAG CAC CGC ATC -AGC GCG ACC-TCC TCC TCG-GAG GAG AGC-AGC AAC -AAG -43-3 

Gin His Arg lie Ser Ala Thr Ser Ser Ser Glu Glu Ser Ser Asn Lys 

125 130 135 

GGC CAG CGC CAG TTG ACC GTC TCG ACG GCC CCC CCG ACC GAC GTC AGC 4 81 

Gly Gin Arg Gin Leu Thr Val Ser Thr Ala Pro Pro Thr Asp Val Ser 
140 145 150 155 

CTG GGG GAC GAG CTC CAC TTA GAC GGC GAG GAC GTG GCG ATG GCG CAT 529 
50 Leu Gly Asp Glu Leu His Leu Asp Gly Glu Asp Val Ala Met Ala His 

160 165 170 

GCC GAC GCG CTA GAC GAT TTC GAT CTG GAC ATG TTG GGG GAC GGG GAT 577 
Ala Asp Ala Leu Asp Asp Phe Asp Leu Asp Met Leu Gly Asp Gly Asp 
175 180 185 

TCC CCG GGG CCG GGA TTT ACC CCC CAC GAC TCC GCC CCC TAC GGC <3CT 625 
Ser Pro Gly Pro Gly Phe Thr Pro His Asp Ser Ala Pro Tyr Gly Ala 
190 195 200 
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10 



30 



CTG GAT ATG GCC GAG TTC GAG TTT GAG CAG ATG TTT ACC GAT GCC CTT 673 
Leu Asp Met Ala Asp Phe Glu Phe Glu Gin Met Phe Thr Asp Ala Leu 
205 210 215 

GGA ATT GAG GAG TAG GGT GGG TAGATCT 701 
Gly lie Asp Glu Tyr Gly Gly 
220 225 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 226 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Lys Leu Leu Ser Ser lie Glu Gin Ala Cys Asp lie Cys Arg Leu 
20 1 5 10 15 

Lys Lys Leu Lys Cys Ser Lys Glu Lys Pro Lys Cys Ala Lys Cys Leu 
20 25 30 

Lys Asn Asn Trp Glu Cys Arg Tyr Ser Pro Lys Thr Lys Arg Ser Pro 
35 40 45 



Leu Thr Arg Ala His Leu Thr Glu Val Glu Ser Arg Leu Glu Arg Leu 
50 55 60 

Glu Gin Leu Phe Leu Leu lie Phe Pro Arg Glu Asp Leu Asp Met lie 
65 70 75 80 

Leu Lys Met Asp Ser Leu Gin Asp lie Lys Ala Leu Leu Thr Gly Leu 
85 90 95 

Phe Val Gin Asp Asn Val Asn Lys Asp Ala Val Thr Asp Arg Leu Ala 
100 105 110 

4 0 Ser Val Glu Thr Asp Met Pro Leu Thr Leu Arg Gin His Arg lie Ser 

115 120 125 

Ala Thr Ser Ser Ser Glu Glu Ser Ser Asn Lys Gly Gin Arg Gin Leu 
130 135 140 

Thr Val Ser Thr Ala Pro Pro Thr Asp Val Ser Leu Gly Asp Glu Leu 
145 150 155 160 

His Leu Asp Gly Glu Asp Val Ala Met Ala His Ala Asp Ala Leu Asp 
50 165 170 175 

Asp Phe Asp Leu Asp Met Leu Gly Asp Gly Asp Ser Pro Gly Pro Gly 
180 185 190 

Phe Thr Pro His Asp Ser Ala Pro Tyr Gly Ala Leu Asp Met Ala Asp 
195 200 205 



60 



Phe Glu Phe Glu Gin Met Phe Thr Asp Ala Leu Gly lie Asp Glu Tyr 
210 215 220 



Gly Gly 
225 



(2) INFORMATION FOR SEQ ID NO: 3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

- - - (xi)- SEQUENCE- DESCRIPTION: -SEQ- ID NO: -3: _ _ 

GGCAACAATG AAGCTACTGT CTTCTATC 

(2) INFORMATION FOR SEQ ID NO: 4: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GGCAGATCTA CCCACCGTAC TCGTCAATTC C 

(2) INFORMATION FOR SEQ ID NO : 5: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 109 base pairs 

(B) TYPE: nucleic acid 

- iC) -STRANDEDNESS-: single- - - - - - - - - „ - 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GGCAAGCTTG GATCCAACAA TGAAGCTCCT GTCCTCCATC GAGCAGGCCT GCGACATCTG 
CCGCCTCAAG AAGCTCAAGT GCTCCAAGGA GAAGCCGAAG TGCGCCAAG 



(2) INFORMATION FOR SEQ ID NO: 6: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 8 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
TCCTCTCGAG GGAAGATCAG GAGGAAGAGC TGCTCCAGGC GCTCCAGGCG GGACTCCACT 
TCGGTGAGGT GGGCGCGGGT CAGCGGGGAG CGCTTGGTTT TGGGAGAG 

(2) INFORMATION FOR SEQ ID NO: 7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7: 
TTCCCTCGAG AGGACCTCGA CATGATCCTG AAAATGGACT CCCTCCAGGA CATCAAAGCC 
CTGCTCACCG GCCTCTTCGT C 

(2) INFORMATION FOR SEQ ID NO: 8: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GGGTGAGGGG CATGTCGGTC TCCACGGAGG CCAGGCGGTC GGTGACGGCG TCTTTGTTCA 
CGTTGTCCTG GACGAAGAGG CCGGTGAGC 

(2) INFORMATION FOR SEQ ID NO: 9: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GGAGACCGAC ATGCCCCTCA CCCTGCGCCA GCACCGCATC AGCGCGACCT CCTCCTCGGA 
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GGAGAGCAGC AACAAGGGCC 80 

(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHTUIACTERISTICS : 

(A) LENGTH: 83 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

_ (xi) ^SEQUENCE-DESCRIPTION : _SEQ_ ID_ NO : -10 : _ __ _ „ _ ^_ 

10 AGTGGAGCTC GTCCCCCAGG CTGACGTCGG TCGGGGGGGC CGTCGAGACG GTCAACTGGC 60 

GCTGGCCCTT GTTGCTGCTC TCC 83 

(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
2 0 ATATCCTGTC AAACACTGGA TCCGAGCTCC AATTCATAGT TTAAACTGAA GGCGGG 56 

(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 5 base pairs 
- ^ (B)-- TYPE nucleic acid- „ ^ _ . ._ _ _ 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GGCAGATCTT CGCAAGACCC TTCCTCTATA TAAGG 3 5 
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(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 



( C ) STRANDEDNES S : s i ngl e 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CACACAAACG GTGATACGTA 



(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GAACTCTAGA AGCTACTCCA CGTCCATAAG GGACACATCA CAATCCCACT ATCCTTCGC 



(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
TAGGTTTACC CGCCAATATA TCCTGTCAAA CACTGGATCT TCGCAAGACC CTTCCTCTAT 
ATAAGGAAGT TCATTTCATT TGGAGAGGAC A 



CA 02246414 1998-07-31 



F . 1 .1^ i^fl'w.,!': si-ec*. \Ev5<:s4 : 7.0OC 

27 

CLAIMS 

1. A niicieic acid sequence, expressible in a plant cM, encoding at least tin 
effective portion of a GAL4 DNA-binding domain, the sequence having a % 
A/T base content substantially reduced relative to the vvild-iype yeast sequence, 
characterised in tliat the % A/T base content ts less tiian 45%. 

2. A nucleic acid sequence is claimed in Claim 1 characterised in tlial the % A/T 
"basc content is less than"40%. " " - - - - - . 

3. A sequence according to Claim 1 or Claim 2, further comprising a structural 
and/or regulatory sequence in operable linkage to the sequence encoding the 
effective portion of the GAL4 DNA-binding domain, 

4. A sequence riccordiiig to any one of Claims 1 to 3 wherein the nucleic acid 
encodes a peptide or polypeptide, having Cranscriptional aetisation activity, in 
operable linkage with the sequence encoding the effective portion of the GAL4 
DNA-binding domain. 

5. A sequence according to any one of Claims 1 lo 4; wherein the sequence 
encodes a peptide or polypeptide, having transcriptional activaiton activity in a 
plant cell, which is a modified portion of GAL4, VPA16, GCN4 or a designed 
sequence which does not naturally occur. 

6. A nucleic acid construct comprising a sequence in accordance with aiiy one of 
claims 1 to 5, 



A construct according to Claim 6, capable of inserting a sequence according to 
any one of Claims 1 to 5 into the genome of a plant host cell. 

\ construct according to Claim 6 or 7, further comprising a reporter gene in 
operable linkage with a CiAr,4-responsive upstream activation sequence (UAS). 
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A construcl nccording to any one of Ch^ims 6, 7 or 8, cnnprising a sequence 
encoding GL'S or green fluorescent protein (GFP) suirable for expression in a 
plant cell. 

A construct according u> any one of Claims 6 to 9, comprising l-.DNA borders 
or Ds elements. 

A construct according to any one of Clnims fi-to 10. ^vhcroin expression of the 
sequence according or Claims 1 to 4 is under ihe control of an enhancer- 
dependent. plant promoter. 

A construct according to any one of Claims 6 to 11. further comprising a 
selectable n^arker gene. 



A plant or part thereof compnsing a nucleic acid sequence according to any 
of Claims 1 to 5, or tran.sfom^ed witli a construct iiccording to any one of Ci 
6 lo 12. 



one 
aims 



A plant or part thereof according to Claim 13, ^vlicrcin the sequence is stably 
maintained in the plant gcnonie. 

A iibi-ary comprising a plurality of planus or parts thereof, each plant or part 
(hereof comprising a stably integrated nucleic acid sequence accordmg to Claim 
4 or Claim 5, which sequence causes the expression of a reporter gene in a 
known temporal and/or .spatial pattern. 

A library according to Claim 15. made by the inU-oduction of a nucleic acid 
construct according to any one of Claims 7 to 12 into a plurality of plants or 
parts Ihoreof. 

A library according lo Claim 15 or 16 comprising a plurality of Arabidopsis 
(hitliuna plants. 
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A method of expressing a gene of inlcre^>t in a known pattern in a plani or part 
Ihereof, the nicthtxl comprising: inlroducing ihc- gene or intercut into Ihe plant or 
part thereof, said gene of intorest having a CtAL4- responsive upslreani 
activation sequence (UAS), chaiactcnscd in rliat .said plant or part thereof 
comprises a reporter gene cxpre<;$ed in a known pattern iirider the itifluence of a 
tran.scnplional activator comprising an effect jye_portion_ of a GAL4 DNA- 
binding domain encoded by a sequence according to any one of Claims 1 to 5, 
such that binding of the transcriplional activator to the UAS causes 
- transcriptional aclivationof the geneofintcrcstr 

A method according to Claim 18, wherein the gene of interest is introduced into 
one or more plants or parts thereof from a library according to any one of 
Claims 15, 16 or 17. 

A method for co-ordinating the expression of a plurality of genes of interest in a 
plant or part ll:iereof, each of the genes of intcrebl being functionally associated 
with a GAL4-rcsponsivc UAS, charaelerised in tiiat the plant or part thereof 
comprises a sequence according to any one of Claims 1 lo 5 and is capable of 
expressing a Irun^icriptional aciivutor comprising an effective portion of a GAL4 
DNA-binding domain, such that binding of the transcriptional activator to the 
UAS causes sinmltanccits transcriptional aclivution of all of the genes of 
interest. 

A melhvxi according to Claim 20, wherein at least some of the genes of interest 
arc arranged iX)Iycistronically and associated with a single GAL4-respon3ivc 
UAS, 

A method according to Claim 20, wherein genes of interest arc functionally 
associated with respective GAL4-rcspon5ivc UASs. 
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Modified Ti plasmid right border 

TAGGTTTACCCGCCAATATATCXrrGTaiAACACT GGATCTTCGCAAGACCCTT 
I 25 bp repeat j 

mRNA start 
I 

CCTCTATATAAGGAAGTTCATTTCATTTGGAGAGGACA. .GAL4- gene 
TATA Box 



Fig. 3 
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UAS-nGUS 



X»gluc stAined 
brighcfieJd 



Fig. 4a 
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Fig. 4c 
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Fig. 4b 



Fig Ad 
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